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What is claimed is : 

1. A method for transforming and electronic document comprising the steps of: 
providing a visual representation of an original electronic document to a user; 
5 receiving feedback from interaction by said user with said visual representation; 

constructing one or more transformation rules using said feedback, said one or more 
extraction rules defining transformation of said electronic document; and 

applying said one or more extraction rules to said electronic document, a second electronic 
document or future instances of said original document to generate a virtual page of customized 
10 content. 

CI 2. The method of claim 1 wherein said electronic document is a Web page and said future 

^ '% 

1% instance is a more recent version of said Web page. 

I ff 15 3. The method of claim 1 wherein said feedback is used to generate: 
!! ;f one or more virtual tags, said virtual tags defining features of a portion of said original 

electronic document. 

4, The method of claim 3 further comprising the step of: 

1 3 20 storing said one or more virtual tags with said one or more transformation rules as a 

respective one or more virtual tag objects; and 

retrieving said one or more stored virtual tag objects from said virtual repository when 
subsequently accessing said electronic document, said stored one or more transformation rules 
being used to generate said virtual page. 

25 

5. The method of claim 3 wherein said one or more virtual tags and said one or more 
transformation rules are determined by the steps of 

a. selecting one or more document elements for inclusion or exclusion in said virtual page 
from said visual representation of said original electronic document using a graphical user 
30 interface; 
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b. identifying said selected document elements using features of a personal data content 
mining (PDCM) feature set and an intent of said user to include or exclude said document element 
in said virtual page; 

c. collecting said identified document elements into a set; and 

5 d, applying a classification algorithm to said set to classify said one or more document 

elements into a respective said one or more virtual tags and generate said one or more 
transformation rules. 

6. The method of claim 5 wherein after said step d. of applying a classification algorithm 
10 further comprising the steps of 

e. indicating said one or more virtual tags to said user at said visual representation; and 
f approving said indicated one or more virtual tags; or 

g. disapproving said indicated one or more virtual tags, wherein when said indicated one or 
more virtual tags are disapproved repeating step a, through step e. 

15 

1. The method of claim 5 wherein said original electronic document is an original Web page 
said PDCM feature set comprises element description space features, 

8. The method of claim 7 wherein said element description space features comprise one or 

20 more of the following features: 

bold, not bold, italic, not italic, underline, not underline, superscript, subscript, normal type, 
number of links encountered before which document element within a current nested structure, 
size of a font, foreground color, background color, font face, surrounding header level, 
immediately preceding header level, immediately preceding comment text, table body, header, 

25 footer, caption, not a caption, cascading style sheet class, beginning of the current nested structure, 
amount of preceding visual space, pattern of preceding visual breaks, number of preceding visual 
breaks, path through a nested structure of said original Web page, table row at a document 
structure depth, table column at a document structure depth, and item count at a document 
structure depth. 
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9. The method of claim 5 wherein said original electronic document is an original Web page 
said PDCM feature set comprises path feature space features. 

10. The method of claim 9 wherein said path feature space features comprise one or more of 
the following features: 

a sequence, number of line breaks in a sequence, number of table cells in one row in a 
sequence, number of table cells in one column in a sequence, relativized feature space attributes, 
and number of preceding visual breaks of an item list number at a document structure depth. 

11. The method of claim 5 further comprising the steps of 

determining stability of each of said features of said PDCM feature set; and 
selecting said features of said PDCM feature set having a highest stability in said step d. of 
applying said classification algorithm. 

12. The method of claim 3 wherein said original electronic document is an original Web page 
said one or more virtual tags and said one or more extraction rules are determined by: 

determining structural relationships of said original Web page to form a tree structure; 
selecting one or more structural objects from said visual presentation of said original Web 

page; 

selecting one or more document elements for inclusion or exclusion in said virtual Web 
page from said visual representation of said original Web page using a graphical user interface; 

identifying said selected document elements using features of personal data content mining 
a (PDCM) feature set and an intent of said user to include or exclude said document element in 
said virtual Web page; 

collecting said identified document elements into a set; 

applying a classification algorithm to said set to classify said one or more document 
elements into a respective said one or more virtual tags as one or more first virtual tags; 

determining one or more second virtual tags from said feedback and said one or more 
structural objects; 

associating said one or more second virtual tags to said tree structure; and 
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applying learning to associate said one or more first virtual tags to said one or more second 
virtual tags and to generate said one or more transformation rules. 

13. The method of claim 4 wherein said virtual page is generated by the steps of: 
5 h. building a tree structure of said original electronic document; 

i. selecting a leaf table of said tree structure; 

j. determining a plurality of ordering schemes for said virtual tag objects; 

matching said virtual tag objects to said ordering schemes for a leaf of said leaf table; 
I. selecting an ordering scheme for said leaf; 
10 m, replacing a parent leaf of said leaf table v^ith said selected ordering; and 

n. determining if a next leaf is a tree root, wherein if said next leaf is a tree root 
determining an outline from said ordering of virtual tags as a virtual page or if said next leaf is not 
a tree root repeating step h through step n, 

15 14. The method of claim 4 wherein said virtual page is generated by the steps of: 
0. selecting one of said one or more retrieved virtual tag objects; 
p. determining said one or more virtual tag objects associated with said selected retrieved 
virtual tag object; 

q. determining a relative path definition between said selected retrieved virtual tag object 
20 and said determined associated virtual tag objects; and 

r. determining if said relative path definition has been determined for all of said virtual tag 
objects, wherein if said relative path definition has been determined for all of said virtual tag 
objects, creating a virtual page fi-om said retrieved virtual tag objects and said relative path 
definition or if said relative path definition has not been determined for all of said virtual tag 
25 objects repeating step o. through step r. 

15. The method of claim 3 wherein a first said one or more virtual tags is a portion of said 
original electronic document to be cut and a second one of said one or more virtual tags is a 
portion of said electronic document to be pasted and said one or more transformation rules being 
30 constructed fi-om said first virtual tag and said second virtual tag for determining a cut and paste 
operation. 
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16. The method of claim 5 wherein said one or more transformation rules are applied to a more 
recent version of said original Web page. 

5 17. The method of claim 3 wherein said one or more virtual tags is a portion of a Web page to 
be formatted for determining a presentation of said virtual page. 

18. The method of claim 17 wherein said one or more transformation rules are applied to a 
more recent version of said original Web pages. 

10 

19. The method of claim 3 wherein said one or more virtual tags are generated by the steps of 
categorizing all elements of said original Web page as a plurality of cubes; 
determining assignment of said feedback to said cubes; and 

browsing said cubes to create said one or more virtual tags. 

15 

20. The method of claim 4 further comprising the steps of 

building a tree representation of said original electronic document; and 
determining a document scheme from said tree representation of said original electronic 
document and a tree structure of alternate versions of said original electronic document. 

20 

21 . The method of claim 20 further comprising the steps of 

determining if a more recent version of said original electronic document has a document 
scheme similar to said determined document scheme, 

wherein if said more recent version of said original electronic document has a document 
25 scheme similar to said determined document scheme, creating said virtual page from said retrieved 
one or more virtual tag objects and said most recent version of said original electronic document; 

if said subsequent original electronic document does not have a document scheme similar 
to said determined document scheme, revising said virtual tags and said one or more 
transformation rules; and 
30 creating said virtual page from said most recent version of said original electronic 

document with said revised one or more virtual tags and said revised one or more extraction rules. 
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22. The method of claim 21 wherein if said subsequent original electronic document does not 
have a document scheme similar to said determined document scheme, further comprising the step 
of: 

marking said virtual tags and said transformation rules as expired. 

23 . The method of claim 4 further comprising the steps of: 

monitoring consecutive instances of said stored one or more virtual tags; 

categorizing the type of said stored one or more virtual tag having consecutive instances 
with characteristics to form a definition for said one or more virtual tags; and 

matching related one or more virtual tags to form a virtual link in said virtual tag 
repository. 

24. The method of claim 4 wherein said virtual tag object is formatted as an extensible markup 
language (XML) view. 

25. The method of claim 1 wherein said visual representation is accessed with a graphical user 
interface. 

26. The method of claim 1 wherein said graphical user interface includes a proxy for 
monitoring actions of said user. 

27. The method of claim 3 further comprising the step of 

monitoring the one or more virtual tags and the one or more transformation rules. 

28. The method of claim 27 wherein said monitoring step provides microstatistics of the use of 
said one or more virtual tags and said one or more transformation rules. 

29. The method of claim 27 wherein said monitoring step provides monitoring of a user 
subscription to said one or more virtual tags or one or more transformation rules. 
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30. The method of claim 27 wherein said monitoring step further comprises the step of: 
sending a message to a content provider after creation of predefined one or more of said 

virtual tags. 

3 1 . The method of claim 3 wherein said one or more virtual tags previously constructed are 
used for constructing one or more new virtual tags and one or more new transformation rules. 

32. The method of claim 1 further comprising the step of 
linking said transformation rules to parameterized pages. 

33. The method of claim 1 wherein said original electronic document is a chapter of a book and 
said second electronic document is a plurality of chapters of said book. 

34. A system for transforming an electronic document comprising: 

means for providing a visual representation of an original electronic document to a user; 

means for receiving feedback from interaction by said user with said visual representation; 

means for constructing one or more transformation rules using said feedback, said one or 
more transformation rules defining transformation of said electronic document; and 

means for applying said one or more extraction rules to said electronic document, a second 
electronic document or fiiture instances of said original document to generate a virtual page of 
customized content. 

35. The system of claim 34 wherein said electronic document is a Web page and said future 
instance is a more recent version of said Web page. 

36. The system of claim 34 wherein said feedback is used to generate: 

one or more virtual tags, said virtual tags defining features of a portion of said original 
electronic document. 

37. The system of claim 36 fiirther comprising: 
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means for storing said one or more virtual tags with said one or more transformation rules 
as a respective one or more virtual tag objects in a virtual repository; and 

means for retrieving said one or more stored virtual tag objects from said virtual repository 
when subsequently accessing said electronic document, said stored one or more extraction rules 
5 being used to generate said virtual page. 



38. The system of claim 35 wherein said means for constructing one or more extraction rules 
comprises: 

means for selecting one or more document elements for inclusion or exclusion in said 
10 virtual page from said visual representation of said original electronic document using a graphical 
user interface; 

means for identifying said selected document elements using features of a personal data 
content mining (PDCM) feature set and an intent of said user to include or exclude said document 
element in said virtual page; 
15 means for collecting said identified document elements into a set; and 

means for applying a classification algorithm to said set to classify said one or more 
document elements into a respective said one or more virtual tags and generate said one or more 
transformation rules. 



20 39. The system of claim 38 further comprising: 

means for indicating said one or more virtual tags to said user at said visual representation; 

and 

means for approving said indicated one or more virtual tags; or 
means for disapproving said indicated one or more virtual tags, wherein when said 
25 indicated one or more virtual tags are disapproved determining revised one or more virtual tags 

and applying said means for indicated said one or more virtual tags to said user at said visual 

representation and means for approving said indicated virtual tags. 

40. The system of claim 38 wherein said electronic document is a Web page PDCM feature set 
30 comprises element description space features. 
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41 . The system of claim 40 wherein said element description space features comprise one or 
more of the following features: 

bold, not bold, italic, not italic, underiine, not underiine, superscript, subscript, normal type, 
number of links encountered before which document element within a current nested structure, 
size of a font, foreground color, background color, font face, surrounding header level, 
immediately preceding header level, immediately preceding comment text, table body, header, 
footer, caption, not a caption, cascading style sheet class, beginning of the current nested structure, 
amount of preceding visual space, pattern of preceding visual breaks, number of preceding visual 
breaks, path through a nested structure of said original Web page, table row at a document 
structure depth, table column at a document structure depth, and item count at a document 
structure depth. 

42. The system of claim 38 wherein said original electronic document is an original Web page 
said PDCM feature set comprises path feature space features. 

43 . The system of claim 42 wherein said path feature space features comprise one or more of 
the following features: 

a sequence, number of line breaks in a sequence, number of table cells in one row in a 
sequence, number of table cells in one column in a sequence, relativized feature space attributes, 
and number of preceding visual breaks of an item list number at a document structure depth. 

44. The system of claim 38 further comprising: 

means for determining stability of each of said features of said PDCM feature set; and 
means for selecting said features of said PDCM feature set having a highest stability in said 
means for applying said classification algorithm. 

45. The system of claim 36 wherein said original electronic document is an original Web page 
said one or more virtual tags and said one or more transformation rules are determined by: 

means for determining structural relationships of said original Web page to form a tree 
structure; 
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means for selecting one or more structural objects from said visual presentation of said 
original Web page; 

means for selecting one or more document elements for inclusion or exclusion in said 
virtual Web page from said visual representation of said original Web page using a graphical user 
interface; 

means for identifying said selected document elements using features of personal data 
content mining a (PDCM) feature set and an intent of said user to include or exclude said 
document element in said virtual Web page; 

means for collecting said identified document elements into a set; 

means for applying a classification algorithm to said set to classify said one or more 
document elements into a respective said one or more virtual tags as one or more first virtual tags; 

means for determining one or more second virtual tags from said feedback and said one or 

more structural objects; 

means for associating said one or more second virtual tags to said tree structure; and 
means for applying learning to associate said one or more first virtual tags to said one or 

more second virtual tags and to generate said one or more transformation rules. 

46, The system of claim 37 wherein said virtual page is generated by: 
means for building a tree structure of said original electronic document; 
means for selecting a leaf table of said tree structure; 

means for determining a plurality of ordering schemes for said virtual tag objects; 
means for matching said virtual tag objects to said ordering schemes for a leaf of said leaf 

table; 

means for selecting an ordering scheme for said leaf; 

means for replacing a parent leaf of said leaf table with said selected ordering; and 
means for determining if a next leaf is a tree root, wherein if said next leaf is a tree root 
determining an outline from said ordering of virtual tags as said virtual page. 

47. The system of claim 37 wherein said virtual page is generated by: 
means for selecting one of said one or more retrieved virtual tag objects; 
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means for determining said one or more virtual tag objects associated with said selected 
retrieved virtual tag object; 

means for determining a relative path definition between said selected retrieved virtual tag 
object and said determined associated virtual tag objects; and 
5 means for determining if said relative path definition has been determined for all of said 

virtual tag objects, wherein if said relative path definition has been determined for all of said 
virtual tag objects, creating a virtual page fi-om said retrieved virtual tag objects and said relative 
path definition. 

10 48. The system of claim 36 wherein a first said one or more virtual tags is a portion of said 
original electronic document to be cut and a second one of said one or more virtual tags is a 
portion of said electronic document to be pasted and said one or more transformation rules being 
constructed fi-om said first virtual tag and said second virtual tag for determining a cut and paste 
operation. 

15 

49. The system of claim 48 wherein said one or more transformation rules are applied to a 
more recent version of said original Web page. 

50. The system of claim 36 wherein said one or more virtual tags is a portion of a Web page to 
20 be formatted for determining a presentation of said virtual page. 

5 1 . The system of claim 50 wherein said one or more transformation rules are applied to a 
more recent version of said original Web page. 

25 52, The system of claim 36 wherein said one or more virtual tags are generated by: 

means for categorizing all elements of said original electronic document as a plurality of 

cubes; 

means for determining assignment of said feedback to said cubes; and 
means for browsing said cubes to create said one or more virtual tags. 



30 



1419-134 



53, The system of claim 37 comprising: 

means for building a tree representation of said original electronic document; and 
means for determining a document scheme from said tree representation of said original 
electronic document and a tree structure of alternate versions of said original electronic document. 

5 

54. The system of claim 53 further comprising: 

means for determining if a more recent version of said original electronic document has a 
document scheme similar to said determined document scheme, 

means for creating said virtual page from said retrieved one or more virtual tag objects and 
10 said most recent version of said original electronic document, if said more recent version of said 
original electronic document has a document scheme similar to said determined document scheme; 
or 

; i means for revising said virtual tags and said one or more transformation rules, if said 

subsequent original electronic document does not have a document scheme similar to said 
[1 15 determined document scheme; and 

means for creating said virtual page from said most recent version of said original 
m electronic document with said revised one or more virtual tags and said revised one or more 

L^^ extraction rules, 

1(1 20 55. The system of claim 54 further comprising: 

li means for marking said virtual tags and said transformation rules as expired, if said 

subsequent original electronic document does not have a document scheme similar to said 
determined document scheme. 

25 56. The system of claim 37 further comprising: 

means for monitoring consecutive instances of said stored one or more virtual tags; 

means for categorizing the type of said stored one or more virtual tags having consecutive 
instances with characteristics to form a definition for said one or more virtual tags; and 

means for matching related one or more virtual tags to form a virtual link in said virtual tag 
30 repository. 
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57. The system of claim 37 wherein said virtual tag object is formatted as an extensible markxip 
language (XML) view, 

58. The system of claim 34 wherein said visual representation is accessed with a graphical user 
5 interface. 

59. The system of claim 34 wherein said graphical user interface includes a proxy for 
monitoring actions of said user. 

10 60. The system of claim 37 further comprising: 

means for monitoring the one or more virtual tags and the one or more transformation 

rules. 

61. The system of claim 60 wherein said monitoring means provides microstatistics of the use 
15 of said one or more virtual tags and said one or more transformation rules. 

62. The system of claim 60 wherein said monitoring means provides monitoring of a user 
subscription to said one or more virtual tags or one or more transformation rules. 

20 63. The system of claim 60 wherein said monitoring means further comprises means for 
sending a message to a content provider after creation of predefined one or more of said virtual 
tags. 

64, The system of claim 37 wherein said one or more virtual tags previously constructed are 
25 used for constructing one or more new virtual tags and one or more new transformation rules. 

65. The system of claim 34 ftirther comprising: 

means for linking said transformation rules to parameterized pages. 

30 66. The system of claim 34 wherein said original electronic document is a chapter of a book 
and said second electronic document is a plurality of chapters of said book. 
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67. A computer program product for transforming an electronic document comprising: 
means for providing a visual representation of an original electronic document to a user; 
means for receiving feedback from interaction by the user with the visual representation; 

5 means for constructing one or more transformation rules using said feedback, said 

transformation rules defining customization of electronic document; and 

means for applying said one or more extraction rules to said electronic document, a second 
electronic document or fiiture instances of said original document to generate a virtual page of 
customized content. 

10 

68. The computer program product of claim 67 wherein said electronic document is a Web 
page and said future instance is a more recent version of said original Web page. 

69. The computer program product of claim 67 wherein said feedback is used to generate: 
15 one or more virtual tags, said virtual tags defining features of a portion of said original 

electronic document, 

70. The computer program product of claim 69 fiirther comprising: 

means for storing said one or more virtual tags with said one or more transformation rules 
20 as a respective one or more virtual tag objects in a virtual repository; and 

means for retrieving said one or more stored virtual tag objects from said virtual repository 
when subsequently accessing said electronic document, said stored one or more transformation 
rules being used to generate said virtual page. 

25 71 . The computer program product of claim 68 wherein said means for constructing one or 
more extraction rules comprises: 

means for selecting one or more document elements for inclusion or exclusion in said 
virtual page from said visual representation of said original electronic document using a graphical 
user interface; 
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means for identifying said selected document elements using features of a personal data 
content mining (PDCM) feature set and an intent of said user to include or exclude said document 
element in said virtual page; 

means for collecting said identified document elements into a set; and 
5 means for applying a classification algorithm to said set to classify said one or more 

document elements into a respective said one or more virtual tags and generate said one or more 
transformation rules. 

72. The computer program product of claim 7 1 further comprising: 

10 means for indicating said one or more virtual tags to said user at said visual representation; 

and 

means for approving said indicated one or more virtual tags; or 

means for disapproving said indicated one or more virtual tags, wherein when said 

indicated one or more virtual tags are disapproved determining revised one or more virtual tags 
15 and applying said means for indicated said one or more virtual tags to said user at said visual 

representation and means for approving said indicated virtual tags. 

73 . The computer program product of claim 7 1 wherein said original electronic document is an 
original Web page said PDCM feature set comprises element description space features. 

20 

74. The computer program product of claim 73 wherein said element description space features 
comprise one or more of the following features: 

bold, not bold, italic, not italic, underline, not underline, superscript, subscript, normal type, 
number of links encountered before which document element within a current nested structure, 

25 size of a font, foreground color, background color, font face, surrounding header level, 

immediately preceding header level, immediately preceding comment text, table body, header, 
footer, caption, not a caption, cascading style sheet class, beginning of the current nested structure, 
amount of preceding visual space, pattern of preceding visual breaks, number of preceding visual 
breaks, path through a nested structure of said original Web page, table row at a document 

30 structure depth, table column at a document structure depth, and item count at a document 
structure depth. 
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75 . The computer program product of claim 7 1 wherein said original electronic document is an 
original Web page said PDCM feature set comprises path feature space features. 

76. The computer program product of claim 7 5 wherein said path feature space features 
comprise one or more of the following features: 

a sequence, number of line breaks in a sequence, number of table cells in one row in a 
sequence, number of table cells in one column in a sequence, relativized feature space attributes, 
and number of preceding visual breaks of an item list number at a document structure depth. 

77. The computer program product of claim 71 further comprising: 

means for determining stability of each of said features of said PDCM feature set; and 
means for selecting said features of said PDCM feature set having a highest stability in said 
means for applying said classification algorithm. 

78. The computer program product of claim 69 wherein said original electronic document is an 
original Web page said one or more virtual tags and said one or more extraction rules are 
determined by: 

means for determining structural relationships of said original Web page to form a tree 
structure; 

means for selecting one or more structural objects from said visual presentation of said 
original Web page; 

means for selecting one or more document elements for inclusion or exclusion in said 
virtual Web page from said visual representation of said original Web page using a graphical user 
interface; 

means for identifying said selected document elements using features of personal data 
content mining a (PDCM) feature set and an intent of said user to include or exclude said 
document element in said virtual Web page; 

means for collecting said identified document elements into a set; 

means for applying a classification algorithm to said set to classify said one or more 
document elements into a respective said one or more virtual tags as one or more first virtual tags; 
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means for determining one or more second virtual tags from said feedback and said one or 
more structural objects; 

means for associating said one or more second virtual tags to said tree structure; and 
means for applying learning to associate said one or more first virtual tags to said one or 
5 more second virtual tags and to generate said one or more transformation rules. 

79. The computer program product of claim 70 wherein said virtual page is generated by: 
means for building a tree structure of said original electronic document; 

means for selecting a leaf table of said tree structure; 
10 means for determining a plurality of ordering schemes for said virtual tag objects; 

means for matching said virtual tag objects to said ordering schemes for a leaf of said leaf 

table; 

means for selecting an ordering scheme for said leaf; 

means for replacing a parent leaf of said leaf table with said selected ordering; and 
15 means for determining if a next leaf is a tree root, wherein if said next leaf is a tree root 

determining an outline from said ordering of virtual tags as said virtual page, 

80. The computer program product of claim 70 wherein said virtual page is generated by: 
means for selecting one of said one or more retrieved virtual tag objects; 

20 means for determining said one or more virtual tag objects associated with said selected 

retrieved virtual tag object; 

means for determining a relative path definition between said selected retrieved virtual tag 
object and said determined associated virtual tag objects; and 

means for determining if said relative path definition has been determined for all of said 
25 virtual tag objects, wherein if said relative path definition has been determined for all of said 

virtual tag objects, creating a virtual page from said retrieved virtual tag objects and said relative 
path definition. 

81. The computer program product of claim 70 wherein a first said one or more virtual tags is a 
30 portion of said original electronic document to be cut and a second one of said one or more virtual 

tags is a portion of said electronic document to be pasted and said one or more transformation rules 
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being constructed from said first virtual tag and said second virtual tag for determining a cut and 
paste operation. 

82, The computer program product of claim 81 wherein said one or more transformation rules 
5 are applied to a more recent version of said original Web page, 

83 . The computer program product of claim 70 wherein said one or more virtual tags is a 
portion of a Web page to be formatted for determining a presentation of said virtual page. 

10 84. The computer program product of claim 83 wherein said one or more transformation rules 
are applied to a more recent version of said original Web page. 

85. The computer program product of claim 70 wherein said one or more virtual tags are 
generated by: 

15 means for categorizing all elements of said original electronic document as a plurality of 

cubes; 

means for determining assignment of said feedback to said cubes; and 
means for browsing said cubes to create said one or more virtual tags, 

20 86. The computer program product of claim 70 comprising: 

means for building a tree representation of said original electronic document; and 
means for determining a document scheme from said tree representation of said original 
electronic document and a tree structure of ahernate versions of said original electronic document, 

25 87. The computer program product of claim 86 further comprising: 

means for determining if a more recent version of said original electronic document has a 
document scheme similar to said determined document scheme, 

means for creating said virtual page from said retrieved one or more virtual tag objects and 
said most recent version of said original electronic document, if said more recent version of said 
30 original electronic document has a document scheme similar to said determined document scheme; 
or 
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means for revising said virtual tags and said one or more transformation rules, if said 
subsequent original electronic document does not have a document scheme similar to said 
determined document scheme; and 

means for creating said virtual page from said most recent version of said original 
5 electronic document with said revised one or more virtual tags and said revised one or more 
extraction rules. 

88. The computer program product of claim 87 further comprising: 

means for marking said virtual tags and said transformation rules as expired if said 
10 subsequent original electronic document does not have a document scheme similar to said 
determined document scheme. 

89. The computer program product of claim 70 further comprising: 

means for monitoring consecutive instances of said stored one or more virtual tags; 
15 means for categorizing the type of said stored one or more virtual tags having consecutive 

instances with characteristics to form a definition for said one or more virtual tags; and 

means for matching related one or more virtual tags to form a virtual link in said virtual tag 
repository. 

20 90. The computer program product of claim 70 wherein said virtual tag object is formatted as 
an extensible markup language (XML) view. 

91 . The computer program product of claim 67 wherein said visual representation is accessed 
with a graphical user interface. 

25 

92. The computer program product of claim 67 wherein said graphical user interface includes a 
proxy for monitoring actions of said user. 

93. The computer program product of claim 70 further comprising: 

30 means for monitoring the one or more virtual tags and the one or more transformation 

rules. 
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94. The computer program product of claim 93 wherein said monitoring means provides 
microstatistics of the use of said one or more virtual tags and said one or more transformation 
rules. 

5 

95. The computer program product of claim 93 wherein said monitoring means provides 
monitoring of a user subscription to said one or more virtual tags or one or more transformation 
rules. 

10 96. The computer program product of claim 93 wherein said monitoring means fiirther 

comprises means for sending a message to a content provider after creation of predefined one or 
more of said virtual tags. 

97. The system of claim 70 wherein said one or more virtual tags previously constructed are 
15 used for constructing one or more new virtual tags and one or more new transformation rules. 

98. The computer program product of claim 67 further comprising: 
means for linking said transformation rules to parameterized pages. 

20 99. The computer program product of claim 67 wherein said original electronic document is 
chapter of a book and said second electronic document is a plurality of chapters of said book. 
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